A high-resolution glottal pulse tracker

نویسندگان

  • Robert D. Rodman
  • David F. McAllister
  • Donald L. Bitzer
  • D. Chappell
چکیده

A new method of computing the glottal pulse period of voiced speech is given. This algorithm is based on the mathematically derived fact that the amplitudes of the odd harmonics of a periodic function with period P are zero when the function is expanded in a Fourier series whose coefficients are determined by integrating over 2P instead of the usual P. It is shown that such a glottal pulse tracker is extremely sensitive to sudden short-lived shifts in the apparent frequency of the glottal pulse that circumscribe certain consonants in the speech stream. This method may therefore be used to segment these consonants for various analytical purposes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Communication Session 5aSCa: Flow, Structure, and Acoustic Interactions During Voice Production I 5aSCa6. Acoustic coupling during incomplete glottal closure and its effect on the inverse filtering of oral airflow

Inverse filtering of oral airflow using closed-phase linear prediction is expected to preserve the effects of source-filter interactions in the glottal airflow pulse. Under incomplete glottal closure, the glottal airflow estimation is more challenging due to a lowered glottal impedance, increased subglottal coupling, and violated all-pole assumption. To account for these effects, a model-based ...

متن کامل

Application of the bispectrum to glottal pulse analysis

Higher order spectral (HOS) techniques, such as the bispectrum, offer robustness to Gaussian noise and the ability to recover phase information. However, their drawbacks, such as the high variance of estimates and the need for long data records, have limited their use in conventional speech processing applications. As in glottal pulse estimation, all existing inverse filtering approaches use se...

متن کامل

An Algorithm for V/UV/S Segmentation of Speech

Let f(n) be a sampled voice signal. Our goal is to identify the voiced (V) portions of f (as opposed to the silence (S) and unvoiced (UV) portions). In the following discussion, the sampling rate is 22050 Hz, quantized at 8 bits. A window of length of 880 is twice the maximum period of the minimum frequency of 50 Hz we will track in the time domain. We use the algorithm described below to provi...

متن کامل

The perceptual relevance of glottal-pulse parameter variations

The perceptual relevance of changes to glottal-pulse parameters is studied. First, it is demonstrated that a distance measure based on excitation patterns can predict audibility discrimination thresholds for small changes to the R parameters of the Liljencrants-Fant (LF) model. Next, by using this measure the perceptual relevance of the LF parameters is quantified. Results are presented for a n...

متن کامل

The GlottHMM Entry for Blizzard Challenge 2011: Utilizing Source Unit Selection in HMM-Based Speech Synthesis for Improved Excitation Generation

This paper describes the GlottHMM speech synthesis system for Blizzard Challenge 2011. GlottHMM is a hidden Markov model (HMM) based speech synthesis system that utilizes glottal inverse filtering for separating the vocal tract and the glottal source from speech signal and models both components individually. In this year’s entry, stabilized weighted linear prediction (SWLP) is used to yield mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000